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When an infant leaves the secure surroundings of the 
mother’s womb to enter the mysterious world which 
awaits, the first attempt to communicate is made through 
crying. This vocal communication, as the child grows up, 
eventually results in the use of spoken language. Another 
means of communication used by the child is visual, 
through the use of gestures, pictures and the written word. 
In order to communicate effectively, both forms must be 
mastered. The use of voice, however, is the preferred 
medium and is practiced and refined throughout life. 


We have always been limited in our effective use of the 
computer by the extent of our ability to communicate with 
it. Input/output functions typically are the slowest aspects 
of a computer, therefore, any method that speeds up our 
ability to interact with a computer should be worth pursu- 
ing. Much work has been done over the last 30 years in 
developing computer-based speech communications. In 
this report we'll examine four distinct areas of man-ma- 
chine speech communications: voice recognition, voice 
synthesis, voice responses, and voice store-and-forwarding. 
We'll take a detailed look at the respective technologies, 
applications for speech systems, planning considerations if 
you want to add speech to your operations, and a look to 
the future of this exciting technology. At the end of the 
report is a list of 35 speech vendors, followed by compari- 
son charts detailing 83 speech products. Datapro hopes this 
report will be a valuable tool in your planning efforts, 
particularly since there are so many different systems 
available. 
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All About Speech Technology, takes an in-depth 
look at computer-based speech, and features 
comparison charts detailing 83 products from 35 
manufacturers. Speech technology encompasses 
four distinct areas: voice recognition, voice syn- 


thesis, voice response, and voice store-and-for- 
ward. In addition to a review of the individual 
techniques, our report describes the current appli- 
cations for speech systems, previews some excit- 
ing new uses for speech, and offers you tips to 
assist you in planning for a speech system. 


SPEECH—A UNIQUELY HUMAN ABILITY 


Although most of the higher animals generate a wide 
variety of sounds they use for communicating, it is the 
human race that has taken this capability to its most 
sophisticated level. Not content with just a series of grunts, 
clicks, and other primitive noises, humans found they 
could join these various sounds into more complex struc- 
tures, which evolved into speech as we know it today. What 
we basically take for granted is actually the product of a 
complex grouping of elements, without which it would be 
impossible to produce speech. The key element, the vocal 
chords, vibrate to produce what are referred to as “voiced” 
sounds, typically vowels. The other key element in speech 
is the vocal tract, which includes the mouth, tongue, nasal 


cavity, and throat. All the other specialized sounds, such as > 


Votan’s Voice Management Sys- 
tem (VMS) offers voice I/O support 
for many applications and includes 
four to eight voice channels with 
telephone interface capabilities, a 
Voice Control Unit (8086-based) 
and a MULTIBUS backplane. Op- 
tional hardware includes 256K or 
512K RAM boards, disk controller 
boards, floppy disk drives, CRT ter- 
minals and communications 
boards. 
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CHART A. SPEECH RECOGNITION SYSTEM CONFIGURATION 


Source 


=> consonants and consonant groups (i.e., ch, sh, gr, st, etc.) 


are produced in the vocal tract. Add a healthy set of lungs 
and you have the primary ingredients for speech. We say 
primary because the production of speech is only the first 
step in the communicative process. When we speak, we 
generally don’t utter words at random, or with a relentless 
monotone. Our speech is generally well-ordered, and varia- 
tions in speed, pitch, volume, and timbre give it additional 
meaning. 


The actual production of speech generates a complex series 
of rapidly changing pressure waveforms. Closer examina- 
tion of these waveforms reveals the basic building-block 
elements of speech, known as phonemes. Each language has 
its own set of phonemes, and speech is comprised of 
phoneme groups. Vowel and consonant sounds are, in fact, 
phonemes, and have many different characteristics, de- 
pending on which components of the human speech appa- 
ratus are used. 


Going a step beyond phonemes, we find that the position of 
a phoneme in a word has a significant effect on the ultimate 
sound generated. This implies the need for greater defini- 
tion of specific speech sounds, since on paper a word 
generally has a specific meaning. The same word spoken, 
however, can be perceived in different ways depending on 
how it’s said. Speech scientists have identified allophones 
as “acoustic variants’ of phonemes and use them to more 
explicitly define how a particular word is formed. Alio- 
phones, like phonemes, are as variable as languages them- 
selves, and it’s not unusual for a basic phoneme to have one 
or more allophones derived from it—such is the complex- 
ity of speech. 


The basic components of words are morphemes, and gener- 
ally take the form of prefixes, roots, or suffixes. Most words 
are comprised of one or more morphemes, which are 
themselves based on allophones and phonemes. Without 
going into the mathematical aspects of speech—which are 
extensive—we can see that speech is indeed a complicated 
process. And yet it is the most efficient means of communi- 
cating ever devised. 


Word 
Search, 
Decoding 


System 


Training 


Final 


SPEECH TECHNOLOGY—AN HISTORICAL 
PERSPECTIVE 


The Industrial Revolution confirmed our desire to produce 
more products faster, more efficiently, and less costly. So 
why not speech, also? The idea of artificially produced 
speech and the ability of machines to recognize spoken 
words has fascinated researchers for many years. Mechani- 
cal speaking machines have been identified as far back as 
the eighteenth century. The earliest evidence of electrical 
speech synthesis appeared in the late 1930s with a device 
called the Voder. Developed by Homer Dudley at Bell 
Telephone Laboratories, the device modeled the human 
vocal tract via special filters. It led to the Vocoder, also by 
Dudley, which produced speech from coded electrical sig- 
nals. Numerous artificial speech systems have been devel- 
oped since then. 


Speech recognition systems have a more recent history, 
with most of the early development work done in the 
1950s. Curiously, the earliest example of a speech-con- 
trolled device appeared years before, in the form of a toy 
dog called ‘““Radio Rex.” Whenever you spoke his name, 
Rex would jump out of his house. Actually, as you might 
suspect, almost any sound resembling his name would get 
Rex to respond. Prior to the work done in the 50s, a key 
development which paved the way for speech recognition 
was the sound spectrograph, developed in the 1930s and 
740s, that produced a graphic picture of the energy charac- 
teristics ofan utterance. The early recognition systems were 
oriented more toward individual word understanding, 
while later developments in the 1960s and ’70s began 
focusing on connected speech. One of the most significant 
speech recognition efforts ever undertaken was the Speech 
Understanding Research (SUR) project of the Advanced 
Research Projects Agency (ARPA). Lasting from 1971 to 
1976, its goal was continuous speech recognition using a 
defined vocabulary, and was largely successful in meeting 
its objectives. Much of the work done in ARPA SUR and 
other projects formed the nucleus of the speech recognition 
industry today. 


A natural offshoot of a speech synthesis system is a voice 


response system. Probably the most familiar of these sys- }>> 
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=> tems is telephone intercept, in which a voice advises you if 


a phone number has been changed or disconnected. These 
systems date back about 20 years, and have been a main- 
stay of the telephone industry, particularly with improve- 
ments in computer technology. 


The idea of storing spoken words for subsequent retrieval 
dates back to the phonograph and, later, to magnetic tape. 
What separates the current concept of voice store-and- 
forward from its predecessors is the method of operation. 
Sophisticated computers store speech in digital form, 
which can then be accessed by one or more recipients, all 
with far greater ease and efficiency than ever before. The 
first of these systems appeared in the late 1970s and was 
developed by ECS Telecommunications, Inc. (now VMX, 
Inc.). 


THE KEY AREAS WE COVER 


As you can see, many years of effort have been invested in 
the speech technology industry. Some areas, such as speech 
synthesis and voice response, are well established, with 
diverse product lines. Speech recognition, which you will 
see is more complex than other technologies, has also had 
its share of successes, although certain aspects, such as 
continuous speech recognition, still have quite a way to go 
before entering a real-world environment. Voice store-and- 
forward systems are gradually finding their way into busi- 
ness operations. 


SPEECH RECOGNITION 


Ever stop to think how good we are at recognizing and 
understanding speech? It’s another ability we take for 
granted, and we do it quite well. For a computer, however, 
speech recognition is a much more complicated process. It 
can be defined as the ability to take speech input, and 
respond with the desired output. This rather simplistic 
definition belies the enormous complexities inherent in 
speech recognition. The problem is further complicated by 
the diversity of human vocal capabilities. After all, it’s been 
shown that no two voices sound exactly alike, although 
leading impersonators would like you to believe otherwise. 
Add to this the elements of context and semantics, which 
imply additional meaning to what is said, and you have a 
real challenge. 


The majority of speech recognition systems attempt to 
recognize isolated words, and, by and large, do that quite 
successfully. Only a few systems can recognize groupings of 
words, or continuous speech, and are generally limited in 
their capabilities to only a few words per statement. Again, 
most sytems are speaker-dependent, being programmed to 
respond to specific voice patterns embedded in memory. 
Much more difficult to produce is speaker-independent 
recognition, which means the system must transcend the 
speech boundaries of a few known people and be able to 
understand many unknown ones. Systems must also be 
able to screen out extraneous noises that could complicate 
the recognition process. 
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Speech recognition systems are based on several distinct 
components, as shown on Chart A. A word is spoken into a 
microphone where it is converted to an electrical signal. 
The signal is then processed through an analog-to-digital 
convertor and stored in the system. The signal is sampled 
from 6,000 to 20,000 times per second and each sample 
contains between 6 and 16 bits. This data is then analyzed 
for specific acoustic properties that will identify the indi- 
vidual speech components. Several techniques are used in 
this process, such as zero-crossing, Fast Fourier transform, 
and linear predictive coding (LPC). Once the signal has 
been analyzed, the system must determine where distinct 
acoustic regions, or segments, occur. The segmentation 
process is designed to locate the boundaries between words, 
so that the system will need less computational power when 
it comes time to match the signal with a reference template. 
Once a discrete speech signal is identified, it is then com- 
pared against a reference template library for a possible 
match. The systems must have templates programmed into 
memory. This is how the system is “trained.” Since the 
likelihood of a perfect match is slim, many templates are 
programmed, giving the system a better chance to find a 
match. If a match occurs, the system will perform the 
desired action, such as printing what was spoken. In isolat- 
ed-word systems, this process generally produces a high 
recognition rate, typically above 95 percent. In continuous 
speech recognition systems, the signal must be matched not 
only against a template, but also against a language model. 
This model describes the rules for the permitted vocabulary 
and the creation of word strings (syntax). As you might 
expect, language models for speech systems are not as 
complex as those we use every day, but that is generally due 
to the system’s limited processing power. 


Dynamic programming, which carries with it large compu- 
tational requirements, provides a way to compute many 
more possibilities for finding a template or language model 
match. It is more successful than any other algorithm in 
solving the continuous speech recognition problem. 


Automatic speech recognition has much industrial, mili- 
tary, and commercial potential, and many applications will 
develop as the technology improves and costs come down. 
Some of the most advanced speech recognition algorithms 
have been developed by the U.S. National Security Agency. 
This system focuses on certain words found in verbal 
transmissions from “unfriendly” nations. Although the 
system is not likely to be used for domestic surveillance, we 
should be aware of a possible threat to our privacy in 
coming years. 


Some of the key manufacturers of speech recognition sys- 
tems are Centigram Corporation, Interstate Electronics, 
NEC, Threshold Technology, Verbex Corporation, and 
Votan, Inc. The list is slowly growing, and Datapro encour- 
ages you to keep a close eye to this important industry. 


SPEECH SYNTHESIS, OR VOICE OUTPUT 


Artificial speech is a seasoned technology with many exam- 


ples already in existence. While most speech produced by [=> 
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today’s technology still sounds distinctly nonhuman, the 
day is not far off when you won’t be able to distinguish it 
from the real thing. 


Speech synthesizers use several different methods for con- 
structing speech, as detailed in Chart B. The most basic 
involves recording human-spoken words whose wave 
forms are digitally encoded and stored. To produce a word 
or phrase, the synthesis program retrieves the desired 
words from memory, assembles them in the desired se- 
quence, and sends them to a digital-to-analog decoder for 
generating the voice signal. This method is fairly simple 
and inexpensive, although the reconstructed speech exhib- 
its little conformity to contextual or semantic rules. 


Another technique, similar to waveform encoding, is called 
formant synthesis. Words are spoken into the system, 
where they are analyzed and stored as time variations of 
vocal tract resonances, called formants. To construct 
speech, a special digital filter, under computer control, 
retrieves both voiced and nonvoiced speech components 
and assembles them in the correct sequence. This method 
produces much smoother flowing speech, and more atten- 
tion can be paid to speech characteristics such as pitch, 
duration, and intensity. Both this method and the previous 
one have been used for producing the familiar telephone 
intercept messages. 


Votrax uses digitally-encoded phonemes and allophones to 
construct speech. The method is very similar to formant 
synthesis, and has the capacity for an unlimited vocabu- 
lary. The input is broken down into phonemes; however, 
the resulting speech is synthesized largely from allophones, 
which can convey more contextual information than pho- 
nemes only. Allophones are produced by 12-bit control 
words acting on a preprogrammed phonemic synthesizer. 
Specific word characteristics (pitch, tone, volume, etc.) are 
determined by the control words as they are decoded into 
analog control signals. The system hardware stores the 
rules, or parameters, for synthesizing word forms. These 
parameters are then translated into speech sounds through 
an array of sound generators and filters. A variable-pitch 
generator produces voiced sounds, such as vowels, and a 
white-noise generator creates unvoiced sounds. 


Text-to-speech synthesis is a promising technology, similar 
to formant synthesis, in which typed text is the input 
medium. The system converts alphanumeric characters to 
speech, and also features an unlimited vocabulary. The text 
is compared with the system’s dictionary, and if a match is 
found, the phonetic information needed to construct the 
word is sent to a formant synthesizer, from which speech 
output occurs. The system dictionary can also include 
specific acoustic characteristics (pitch, volume, duration, 
etc.) that will produce a more natural sound. 


Texas Instruments, a leading manufacturer of speech chips, 
uses linear predictive coding (LPC) techniques to simulate 
acoustic conditions found in the human vocal tract. A set of 
three IC chips performs the necessary calculations to gener- 
ate speech, based on complex software algorithms. 
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The advent of integrated circuits has brought synthetic 
speech within reach of most consumers. As ICs get more 
powerful, higher-quality speech should result, not to men- 
tion lower unit costs. Some of the major speech synthesis 
manufacturers include General Instrument, NEC, National 
Semiconductor, Texas Instruments, and Votrax. 


VOICE RESPONSE 


A logical application of voice synthesis technology is in the 
area of computer-based voice response systems. In a typical 
system, such as in Chart C, a user performs some sort of 
data entry function using a keyboard or Touch-Tone tele- 
phone. At the completion of the data entry action, the 
system replies with a specific response. A vocabulary of 
words and phrases is programmed into the system, using 
various technologies mentioned earlier. Upon receipt of a 
specific code, the synthesis program assembles the correct 
response from the vocabulary, where it is then converted to 
speech. The most familiar uses of voice response technol- 
ogy are in the banking industry, with a growing emphasis in 
electronic funds transfer (EFT) and personal banking. 


Voice response systems are usually connected to telephone 
systems, since the telephone (particularly with Touch- 
Tone) is a natural data entry device. It’s rather easy to 
connect a phone system into your company’s computer; 
from that point, add voice response and you’re in business. 
The voice system will have a number of communications 
ports that connect to similar ports on the phone system. 
Datapro recommends that you work closely with both your 
voice response and phone system vendors to maximize the 
effectiveness of your systems. Voice response manufactur- 
ers include Cognitronics Corporation, IBM, Periphonics 
Corporation and VMX, Inc. 


VOICE STORE-AND-FORWARD 


How many times have you tried to call someone, only to 
find that person unavailable? Trying to return your call, the 
person finds you out of the office. You attempt a return 
call—no luck—and on it goes. It’s known as “telephone 
tag,” and we are all familiar with it. It wastes our time, 
causes frustration, and reduces our productivity. Wouldn’t 
it be great to leave your message—voice message—with 
your party so that you know it will be received? 


Most of us are probably familiar with telephone answering 
machines. We can leave our spoken message with the 
system for future retrieval. We know our message will be 
received, but now the recipient has to call us back! This is 
where the concept of “voice mail” enters the picture. Voice 
mail systems are computer-based switching systems that 
record and transmit voice messages. The voice input is 
digitized and stored on disk for further processing. After 
entering your message, you can edit it and then instruct the 
system to route the message to one or more “voice mail- 
boxes.” Your recipients can hear a list of incoming mes- 
sages and selectively play them back, skip ahead or skip 
back among messages, save messages for future retrieval, or 
route the messages to a third party. Entering specific com- 
mands via a Touch-Tone telephone activates the system. 
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<> Voice prompts from the system further assist the user. 


Remote access to the system adds greater convenience, 
particularly for users on the road. Voice mail systems, like 
voice response, connect to a telephone system via commu- 
nications ports. Again, we recommend you coordinate the 
installation of a voice mail system with your phone system 
vendor. 


Voice store-and-forward systems, although a more recent 
development, already have several firms actively market- 
ing them. Pioneering this technology was ECS Telecom- 
munications, Inc. (now VMX, Inc.), which has since been 
joined by such firms as IBM, Wang, and several newer 
players like Voice Computer Technologies Corporation 
and Time and Space Processing, Inc. 


APPLICATIONS FOR SPEECH TECHNOLOGY 


The element of speech adds an entirely new dimension to 
the ways we interact with machines, particularly the com- 
puter. Probably the most obvious use of speech recognition 
technology is source data entry, which in itself can take 
many forms. Applications in use today include: 


¢ Package Sorting, where the sorter speaks a routing code 
into the system, thus assuring the correct ultimate 
destination. 


* Quality Control and Inspection, in a “hands-busy/eyes- 
busy” environment, the QC information is entered by 
voice. 


e Programming Machine Tools, in which a designer or 
engineer verbally describes the parameters of a part to a 
numerical control (NC) system, which then produces a 
punched tape to be used in operating the machine tool. 


e Handicapped Assists, particularly with voice-actuated 
wheelchairs, so that mobility can be achieved without 
hands or feet; also being used by individuals to control 
various functions in their homes, affording them a fuller 
life. 


e Financial Transactions, which can greatly simplify per- 
sonal banking and electronic funds transfers. 


¢ Security Systems, which can assure tighter controls due to 
the uniqueness of individual “voice prints.” 


e Military applications include training air traffic control- 
lers, cockpit communications, and battlefield control of 
weapons systems. 


While speech recognition systems may appear to have a lot 
of applications, they are pale by comparison with speech 
synthesis products. Most speech output products today are 
built from IC chips or boards, which offer tremendous 
flexibility of design. And since most speech chips on the 
market are quite good (in terms of vocabulary and unintel- 
ligibility), the variety of “speaking” products is growing 
daily. Some of the more familiar examples, and a few 
unusual ones, include: 


e Electronic games—Most of the newer arcade games offer 
speech output, and the major home computer vendors 
have speech modules available. 


Educational systems—TI’s Speak’N’Spell was a highly 
successful use of a synthesizer; more systems are being 
designed with speech capabilities. 


° Handicapped users—Designed with a speech recognizer, 
voice output affords the handicapped a virtual “human” 
interface. 


¢ Toys—In addition to Speak’N’Spell and computer 
games, more manufacturers are including speech chips in 
their products. 


¢ Automobiles—Some autos today can tell you when you 
need gas, or if your seatbelts are unfastened; this capabili- 
ty will be available in more cars all the time. 


e Elevators—Instead of you looking up, the elevator will 
tell you where you are! 


e Electronic Test Equipment—How about a voltmeter that 
tells you the readings? There is already one on the market. 


Most of these applications are just the tip of the iceberg. 
Speech components will find their way into more products 
in the coming years than you could ever imagine. 


CHART B. SPEECH SYNTHESIZER CONFIGURATION 
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x~> How about voice response and voice store-and-forward 


systems? While applications for these products are perhaps 
not as diverse as recognition/synthesis systems, there are 
several well-defined and field-proven uses, including: 


¢ Banking—Typical uses are for account status, credit veri- 
fication, bank-by-phone, file inquiries, and loan 
processing. 


¢ Distribution—Order entry, order status, and inventory 
control are frequent uses. 


¢ Manufacturing—Labor reporting and attendance, ma- 
chine usage and maintenance, material requisition and 
control, and job status and costing information. 


¢ Education—Computer-aided instruction (CAI) systems. 


These are primarily voice response systems. Current uses 
for voice store-and-forward systems include routing 
memos, scheduling meetings, sending and receiving status 
reports, handling order input from local or remote sites, 
and broadcasting announcements. Voice store-and-for- 
ward systems were designed to improve individual produc- 
tivity by eliminating the inherent delays found in using the 
telephone, thereby providing more uninterrupted time for 
work. 


FUTURE DEVELOPMENTS IN SPEECH 


Research and development in speech technology is growing 
rapidly. While the major companies in the industry are 
small- to medium-sized, the industry leaders, such as Con- 
trol Data, Hewlett-Packard, IBM, National Semiconduc- 
tor, NCR, Sperry, Texas Instruments, and Wang Laborato- 
ries all have R & D programs of their own well under way. 
Extensive work is also being done in universities and 
government agencies. Most of the work centers on .produc- 
ing more powerful, less costly speech components with 
larger vocabularies, better recognition capabilities, and 
more natural-sounding speech. 


The speech recognition industry is still searching for a truly 
speaker-independent continuous (conversational) speech 
system. Verbex claims that its Model 3000 Voice Data 
Entry Terminal may be trained to understand virtually any 
speaker, regardless of voice inflection or dialect, and can 
interpret continuous, naturally spoken phrases with ex- 
tremely high accuracy in environments having background 
noise as loud as 85 decibels. Most industry experts agree, 
however, that a truly speaker-independent voice recogni- 
tion system—one that requires no “‘training”—is still years 
ahead, unless a significant technological breakthrough oc- 
curs. In the meantime, the speech recognition industry will 
be improving their systems’ understanding capabilities, 
ease of “training,” and vocabularies, while making them 
more cost-effective and easier to use. Improvements in 
semiconductor technology are expected to contribute to 
these goals. 


Some of the newer applications we can anticipate will 
appear in the automated office (the typewriter you speak to 
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Interstate Electronics’ VRC100-2 Voice Recognition Chip Set 
consists of two chips used as building blocks for speech recogni- 
tion sytems and capable of recognizing as many as 100 words or 
Short phrases. This may be expanded to 200 words through the 
use of additional RAM. 


is not that far-fetched any more), where more data entry 
and system command functions will be handled by voice. 
Data terminal operation will be simplified with the use of 
speech recognizers built into intelligent data terminals. Ina 
timesharing environment, the user will sign on to the 
computer system by uttering a password. This may elimi- 
nate the need for the user to go through a “training session” 
to enroll voice patterns into the system. Much progress is 
being made in this area; however, compared to input 
devices such as keyboards, speech technology is very ex- 
pensive. New, programmable integrated circuits may help 
to eventually truly integrate speech recognition and data 
terminals. 


Homes of the future will be under computer control—the 
homeowner takes care of household activities simply by 
“telling” the house what to do. With more and more 
automated functions in industry, greater use of voice input 
is expected to increase individual productivity. Military 
systems using voice technology will be more efficient, 
especially in terms of command and control operations. 
The military aircraft of the future will have many of its 
functions under voice control, leaving the pilot more free- 
dom to respond quickly to changing situations. 


Some other speech applications under development in- 
clude hands-free telephone systems which will allow the 
user to speak on the telephone without the need to touch it 
in any way. Automobiles may soon incorporate speech 
recognition systems that provide a capability for the driver 
to issue a command that will turn on the car’s lights. These 
systems, among others, will help handicapped persons to 
lead more productive lives. 


PLANNING CONSIDERATIONS 


If, after reading the report up to this point, you’ve decided 
that a speech processing system of some type may fit into 
your business or personal plans, we suggest you read fur- 
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CHART C. VOICE RESPONSE SYSTEM CONFIGURATION 
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<> ther. Although current speech applications are generally 


well-defined, speech may or may not be for you. 


Since source data entry is the most common use for speech 
recognizers, you may want to augment some segment of 
your data entry operations with a speech unit. Datapro 
suggests you consider the following: 


¢ Can you effectively replace a skilled data entry operator 
with a speech recognition unit? 


¢ The cost of a complete system could be several times that 
of an intelligent CRT terminal. 


Examine the type of work performed. Can you enhance 
some aspect of an employee’s activity by adding voice? 


Be sure the type of data to be entered can be handled by 
the speech system. Remember, isolated-word speaker- 
dependent systems cost the least, and have the highest 
accuracy. 


¢ How large a vocabulary do you need, and how much 
accuracy does the nature of your business require? 


¢ How critical is system response time to maintaining 
productivity? 


¢ Include some type of feedback mechanism (voice output, 
of course) with your speech system to assure verification. 


Provide sufficient training for users, and make sure the 
system is capable of handling an adequate range of voice 
types. 


Have sufficient equipment to screen out external noise. 


These specific areas should be carefully considered in 
addition to the normal financial analyses you would per- 
form for any capital expenditure. Recognition systems can 
range from a few hundred dollars for a board or chip to over 
$32,000 for the Verbex Model 3000 SPADS speech recogni- 
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tion system. Datapro also encourages you to work closely 
with vendors before buying, since they have a wealth of 
experience they can share with you. 


When examining the possibilities for adding speech output, 
consider the types of information you plan to convert to 
speech. As a test, see whether the information is perma- 
nent, temporary, or transient. If it’s permanent, it’s proba- 
bly better off on paper, since it may be used again. If it’s 
temporary, it probably needs to be accessed rapidly, in 
which case a system CRT is probably the best method. 
Transient information can be considered as the type typi- 
cally sent over the phone, such as weather reports, stock 
market quotes, financial data, and course grades. Since the 
information is voiced, by nature, it becomes a good candi- 
date for a voice output system. 


Since current IC technology has brought down the average 
cost of speech chips to a few dollars, this capability is within 
the reach of more prospective users than ever before. We 
suggest you check carefully with your computer system 
vendor to make sure the system is capable of accommodat- 
ing speech components. Software may very likely represent 
a larger expenditure for you than the speech components. 


Voice response systems can represent a significant invest- 
ment, so make sure your company has a justifiable need for 
adding voice. Since these devices connect to your telephone 
system, you must figure out how much you expect the voice 
response system to be used, and then determine the opti- 
mum number of communicating links between systems to 
maximize the VR system’s availability. If the VR manufac- 
turer cannot perform this traffic analysis, your phone sys- 
tem vendor may do it for you. If you have an in-house 
telecommunications staff, one of them can probably con- 
duct the study. Some advance market research, or perhaps 
contacting other users with similar situations may help 
turn up any other areas for consideration. Pay particular 
attention to the types of messages used, how they are 
spoken, who speaks them (male vs. female voice), and the 
methods for entering data to the system (Touch-Tone pad, 
keyboard, etc.). 
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American Microsystems’ EVK 3620 is a preassembled speech 
evaluation board which includes a speech synthesizer, micro- 
computer, 128K-bit speech data ROM, audio amplifier and 
keyboard. The speech data ROM is programmed with over 100 
words and phrases in a female voice and words can be addressed 
and spoken individually or combined in sequence using the 
keyboard to form phrases or sentences. 


=> Voice store-and-forward systems, like voice response, are 


usually big-ticket items. Advances in technology and pack- 
aging will help provide lower costs and more variations in 
system size. Again, you should conduct a thorough traffic 
analysis of the number of lines connecting the voice mail 
system to the phone system. You should expect a certain 
amount of reluctance from employees who will have to 
learn a new way of using the telephone. Effective salesman- 
ship early on, plus support from top management, should 
assure a smooth transition. Make sure you coordinate the 
installation of the new system with your telephone suppli- 
er, and you should have a relatively easy cutover. Since 
voice mail systems are supposed to reduce phone expenses 
by eliminating excess “telephone tag” phone calls, be pre- 
pared to closely document cost savings from your system, 
as they will ensure the longevity of the installation, and, of 
course, your job. 


IN CONCLUSION 


Datapro feels that we are on the verge of significant devel- 
opments in speech technology, and that the element of 
speech will find its way into our lives more and more all the 
time. Although there is still much to be learned, and more 
technological breakthroughs to be achieved, the industry is 
well-established and on the move. Soon, speaking to a 
computer may be as natural and easy as speaking to a 
friend. Projected improvements in worker productivity 
suggest a bright future for speech systems in industry. 
Speaking to an automated office system should enhance its 
overall value to a company. Home life as we know it will 
probably never be the same! 


THANK YOU 


Datapro takes this opportunity to thank all the vendors 
who graciously supplied us with data on their products. 
Their cooperation is greatly appreciated. 
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THE COMPARISON CHARTS 


The accompanying comparison charts summarize the char- 
acteristics of 83 speech products from 35 vendors. The 
charts have been divided into three distinct areas: voice 
recognition systems, voice synthesis systems, and voice 
response/store-and-forward systems. The type of system is 
indicated at the top of each page. We have identified 20 
voice recognition, 24 voice synthesis, and 35 voice re- 
sponse/store-and-forward products. The information in 
the charts was furnished by the vendors during January 
1984. The absence of any specific company from our charts 
means that the company either failed to respond to our 
requests for information or was unknown to us. The chart 
entries and their significance are explained in the following 
paragraphs. 


Voice Recognition 


We start off by identifying the communications interface, 
such as a serial RS-232-C or parallel connector. In many 
cases, the device described takes the form of a chip or 
printed circuit board, which can be used to build a com- 
plete system. If that’s the case, we’ll tell you at that time. 
We describe the processing facilities in the system, includ- 
ing the type of CPU (manufacturer and model), the amount 
and type (RAM, ROM, PROM, etc.) of main memory, and 
the product’s software characteristics. Frequently the soft- 
ware used is proprietary, and is usually built into the 
system as firmware. Datapro feels the key characteristics of 
a voice recognition system include the recognition tech- 
nique since some methods are more efficient than others; 
and the maximum number of speakers and voice references 
stored, so you can decide whether the product can support 
your needs. If the system handles continuous speech, in 
addition to isolated word recognition, we indicate this. 
Since the system’s performance is a key factor, we indicate 
the percent recognition accuracy and the average response 
time (from spoken word to system response). Any addition- 
al characteristics are also mentioned. We list some typical 
applications, as described by the manufacturer, and pro- 
vide complete details on purchase price, monthly rental, 
maintenance cost, quantity discounts, speed of delivery 
ARO, training, service, and how many are installed. 


Voice Synthesis 


These charts are similar to voice recognition, except in the 
characteristics area. Here we detail the synthesis technique, 
the maximum vocabulary size, which can be in the form of 
words or total seconds of speech; support of foreign lan- 
guages; controls supported, such as pitch, tone, volume, and 
speech rate; and additional characteristics. 


Voice Response/Store-and-Forward 


We begin by identifying the type of system and its configu- 
ration, such as a standalone product or a component within 
a larger system. We tell you the maximum number of 
communications lines that connect the system to a counter- 
part, such as a PABX, and the type of remote terminals 


supported, such as a Touch-tone phone or CRT. Details of > 
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<> the processing facilities are similar to the previous charts, 


and we’ve added the methods for interacting with the 
system, such as tones, voice prompts, etc. Speech character- 
istics include the type of message generated, the size (in 
total words or total time), the message generation tech- 
nique, and the system’s call handling capacity, with the 
potential number of calls handled per hour. Following a list 
of applications, we finish with the same pricing, servicing, 
delivery, etc., details as in the previous charts. 


Suppliers 


Listed below, for your convenience, are the full names, 
addresses, and telephone numbers of 35 firms actively 
engaged in speech technology. Since some manufacturers 
cover more than one product area, at the end of each listing 
we list one or more numbers, in boldface, that correspond 
to the vendor’s area(s) of concentration. The numbers are 
(1) recognition, (2) synthesis, (3) response, and (4) store- 
and-forward. 


American Microsystems, Inc. 

3800 Homestead Road 

Santa Clara, CA 95051 

(408) 246-0330 x2091 2 


BBL Industries, Inc. 

2935 NE Parkway 

Atlanta, GA 30362 

(404) 449-7740 4 


Centigram Corporation 

1362 Borregas 

Sunnyvale, CA 94089 

(408) 734-3222 4 


Cognitronics Corporation 

25 Crescent Street 

Stamford, CT 06906 

(203) 327-5307 3 


APRIL 1984 _ 


Voice Computer Technologies’ 
VCT Series 2000 allows a Touch- 
Tone phone to act as an on-line 
order entry terminal and handles 
up to eight orders concurrently. It 
passes the orders on to a host com- 
puter, checks available stock and 
prints caller confirmations. 


Commtern, Inc. 

900 Middlesex Turnpike Bldg. #5 

Billerica, MA 01821 

(617) 663-4442 4 


Computer Curriculum Corporation 

1070 Arastradero Road 

Palo Alto, CA 94304 

(415) 494-8450 3 


Digital Pathways, Inc. 

1060 E Meadow Circle 

Palo Alto, CA 94303 

(415) 493-5544 3 


Dragon Systems, Inc. 

173 Highland Street 

West Newton, MA 02165 

(617) 527-0372 1 


Engineered Systems, Inc. 

14775 Grover Street 

Omaha, NE 68144 

(402) 333-0100 3 


General Instruments 

600 W John Street 

Hicksville, NY 11802 

(516) 733-3000 1,2 


Interface Technology 


‘10500 Kahlmeter Drive 


St. Louis, MO 63132 
(314) 426-6880 3 


International Business Machines Corporation 

Old Orchard Road 

Armonk, NY 10504 

(Contact representative) 4 


Interstate Electronics Corp. 
1001 E. Ball Avenue, Box 3117 
Anaheim, CA 92803 

(714) 772-2811 
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Maryland Computer Services, Inc. 
2010 Rock Spring Road 
Forest Hill, MD 21050 


(301) 879-3366 2 


Master Specialties Company 
1640 Monrovia Avenue 
Costa Mesa, CA 92627 


(714) 642-2427 3 
Mimic Electronics Company 

P.O. Box 921 

Acton, MA 01720 

(617) 263-2101 1,2 


National Semiconductor Corp. 

21900 Semiconductor Drive 

Santa Clara, CA 95051 

(408) 721-5000 x5402 2 


NEC America, Inc. 

$32 Broad Hollow Road 

Melville, NY 11747 

(516) 694-7910 1,2 


Panasonic Industrial Company 

One Panasonic Way 

Secaucus, NJ 07094 

(201) 348-5270 2 


Perception Technology Corp. 

50 Shawmut Road 

Canton, MA 02021 

(617) 821-0320 3 


Periphonics Corporation 

4000 Veterans Memorial Highway 

Bohemia, NY 11716 

(516) 467-0500 3 


Scott Instruments 

1111 Willow Springs Drive 

Denton, TX 76205 

(817) 387-9514 1 


Speech Plus, Inc. 

461 N. Bernardo Avenue 

Mountain View, CA 94043 

(415) 964-7023 2,3 


Street Electronics Corp. 

1140 Mark Avenue 

Carpenteria, CA 93013 

(805) 684-4593 2 


Texas Instruments, Inc. 
PO Box 2909 

Austin, TX 78769 
(512) 250-7363 


Threshold Technology 

1829 Underwood Boulevard 
Delran, NJ 08075 

(609) 461-9200 


Time and Space Processing, Inc. 


3410 Central Expressway 
Santa Clara, CA 95051 
(408) 730-0200 


Triformation Systems, Inc. 
3132 SE Jay Street 

Stuart, FL 33497 

(305) 283-4817 


- Verbex Corporation 


2 Oak Park 
Bedford, MA 01730 
(617) 275-5160 


VMX, Inc. 

1241 Columbia Drive 
Richardson, TX 75081 
(214) 699-1461 


Voice Computer Technologies 
1911 N. Fort Myer Drive 
Arlington, VA 22209 

(703) 522-5706 


Votan 

4487 Technology Drive 
Fremont, CA 94538 
(415) 490-7600 


Votrax Div. of Federal Screw Works 


1394 Rankin 
Troy, MI 48083 
(313) 588-2050 


Vynet Corporation 
160B Albright Way 
Los Gatos, CA 95030 
(408) 370-0555 


Wang Laboratories 
1 Industrial Avenue 
Lowell, MA 01851 
(617) 459-5000 


© 1984 DATAPRO RESEARCH CORPORATION, DELRAN, NJ 08075 USA 
REPRODUCTION PROHIBITED 


3,4 


1,2 


3,4 


2,3 


3,4 


40 


APRIL 1984 


MANUFACTURER 
AND MODEL 


COMMUNICATIONS INTERFACE 


PROCESSING FACILITIES 
Type 
Main memory 
Software 

SPEECH CHARACTERISTICS 
Recognition technique(s) 


Maximum number of speakers 


supported 

Maximum number of voice 
references stored 

Handle continuous speech? 


Percent recognition accuracy 


Average response time 
Additional characteristics 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 
Number installed 


MANUFACTURER 
AND MODEL 


COMMUNICATIONS INTERFACE 


PROCESSING FACILITIES 
‘| Type 
Main memory 
Software 
SPEECH CHARACTERISTICS 
Recognition technique(s) 


Maximum number of speakers 


supported 

Maximum number of voice 
references stored 

Handle continuous speech? 


Percent recognition accuracy 


Average response time 
Additional characteristics 


TYPICAL APPLICATIONS 


-|PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 
Number installed 
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Dragon Systems 
MARK 


Customized to OEM’s 
system 

6502; 8086 

16K ROM, 8K RAM 
Proprietary 
Proprietary stochastic 
processing 


Unlimited 


Unlimited 


Voice response available 


Data entry/retrieval 


OEM license under $300 


Contact vendor 
Contact vendor 
Contact vendor 


interstate Electronics 
VRT 300 


RS-232-C 


Motorola 68BO2 
RAM-20KB, EPROM 12K 
Firmware 


Pattern generator; 
pattern matching 


Speaker dependent board 


Industrial, mechanical, 
communications 


Contact vendor 


Interstate 
Over 1,000 


General Instrument 
GI SP1000 


Proprietary 
LPC parameter extractor 


Unlimited 


Yes 
6-8 weeks 


NEC 
DP-200 CSR 


RS-232-C, RS-422, 
GPIB; parallel 


Audio response 


Baggage sorting, glass 
inspection, handicapped 
aid, meat inspection 


$12,000 
Contact vendor 


interstate Electronics 
VRC 008 


Voice pattern waveform 
anal., template match. 


Speaker independent chip 


Toys and games 


Contact vendor 


Yes 

30 days 
interstate 
interstate 
Over 20,000 


RS-232-C 


2-8000 
ROM, 32K RAM 
Firmware 


Dynamic programming 
1 


120 words 


Hospital data entry, 
pharmacist inventory, 
chem. analysis inspect. 


$2,000 


Immediate 
NEC, OEM 
NEC 
300 
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Interstate Electronics 
VRC 100-2 


Parallel; RS-232-C 


Motorola 68B03 
EPROM-16K bytes 
Firmware 


Pattern generator; 
pattern matching 

1 

200 

No 

99% 

150 ms 

Speaker dependent chip 
set 


Industrial, mechanical, 
communications, consumes 


Contact vendor 


Interstate 
Interstate 
5,000 


Scott Instruments 
Shadow/VET 


Host-dependent 


Applell+, Franklin 
None 

Proprietary 
Words, phrases 


1+ 


40-word subsets 


Manufacturer 
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Scott Instruments 
VET-232 


Scott Instruments 
VET-2 


Host-dependent RS-232-C 


Appiell+, Ile, Franklin 
10.6K 
Proprietary 


None 
Proprietary 


Words, phrases Words, phrases 
1+ 


40-word subsets 


Manufacturer 


Threshold Technology 
600/680 


Threshold Technology 
Auricle-1/PCB 


RS-232-C; 20 Ma. current |RS-232-C; IEEE 488 


Firmware 


Spectral analysis; 
isolated word 
Unlimited 


Spectral analysis; 
isolated word 


80 words or phrases 


Less than 1 sec. 

High background noise Office noise environment 
immunity 

Quality control; inven. 
control; sortation; 
inspection; data entry 


Data entry; equipment 


for OEMs 


$14,000-$ 15,200 
Not applicable 
Not applicable 
Yes 
60 days 
Not applicable 
Threshold 
100 (-1); 50 (PCB) 


test; development system 


$1,495/Request for quote 


Texas instruments 
Speech Command 
System 


Phone, TIPC Bus 


TMS 320, TMS 7000 
32K Static RAM 


Speaker-dependent 


50/vocabulary © 


Yes (connected speech) 


Transparent keyboard 


Office/knowledge worker, 


factory automation 


Contact TI Dealer 
Contact vendor 
Ti or Dealer 
1000 


Threshold Technology 
Auricle-HP 


Office noise environment 


Data entry; equipment 


test; development system 
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Threshold Technology 
500/580 


RS-232-C; 20 Ma current 


Spectral analysis; 
isolated word 


Less than 1 sec. 

High background noise 
immunity; 

Quality control; inven. 


control; sortation; 
inspection; data entry 


Threshold Technology 
T950 Voice 
Option Board 


Internal (via Televideo 


Office noise environment 


Heavy volume data entry 
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Service provided by 
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Verbex 
Model 3000/SPADS 


Votan 
VX Series Boards 


RS-232-C, 20 mA current |RS-232-C multibus 


loop 


8086 control processor 
.5-1 megabyte 
Proprietary 


Proprietary 


10-100 floppy (3000) 
100 Winchester (SPADS) 


Maintains accuracy 
in 85db noise 
Unlimited duration 
utterances 


$17,900-$32,000 
Base available 


TMS 320, Intel 8086 
128-256K RAM 
Proprietary 


Digital transform; 
dynamic programming 
Unlimited 


255 on board 


.3 second 
SDR, STR & voice output 
Data entry/retrieval; 


remote phone proc.; 
equipment inst. control 


Approximately $6,000 
Contact vendor 


Votan Voice 
Management System 


RS-232-C multibus 


TMS 320, Intel 8086 
+512K RAM 
Proprietary 


Digital transform; 
dynamic programming 
Unlimited 


255 on board 
Yes 


99.97% 
.3 second 


SDR, STR & voice output 


Data entry/retrieval; 
remote phone proc.; 
equipment inst. control 


$28,000 (4 channels) 
Contact vendor 
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Votan 
Voice Terminal 


RS-232-C 


TMS 320, Intel 8086 
64-256K RAM 
Proprietary 


Digital transform; 
dynamic programming 
Unlimited 

150 on board 

Yes 

99.97% 

.3 second 


SDR, STR & voice output 


Data entry/retrieval; 
remote phone proc. 


$3,000-$4,000 
Contact vendor 
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American 
Microsystems, Inc. 
$3610/3620 


20K ROM for 3610 
Firmware 


LPC 


Vocabulary size, maximum words/Unlimited (3620) 


Foreign languages supported 


Controls supported 


Additional characteristics 


_ | TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 


Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 
Number installed 


MANUFACTURER 


AND MODEL 


French, German, Swedish 


Male and female speech 


Natural voice (male or 
female) 

Any application needing 
voice output 


Contact vendor 


Contact vendor 
Data sheets 
Speech Plus, Inc. 


Maryland Comp. Services 


Information Through 
Speech/Total Talk II 


COMMUNICATIONS INTERFACE |RS-232-C 


PROCESSING FACILITIES 
Type 


Main memory 


Software 


SPEECH CHARACTERISTICS 


Synthesis technique(s) 


HP125 using Z-80A 
64K RAM 
Firmware,disk(Info only) 


Phoneme synthesis, 
using Votrax VSM 


Vocabulary size, maximum words/Unlimited 


Foreign languages supported 


Controls supported 


Additional characteristics 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 


Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 
Number installed 


None 
Volume, pitch, tone, 


Visually impaired 


$7,995 to $12,995 (Info) 


$5,995 (Total Talk Il) 


General Instrument 
SP-0256 


Not applicable 


Not applicable 
2K 
Proprietary 
LPC 

491 words 


Unlimited 
All 


Automotive, consumer, 
computer 


$400 at 10K quantity 


Not applicable 
Not applicable 


Not applicable 
Not applicable 


Maryland Computer 
Services 
Total Talk 


RS-232-C 


HP2621A with dual Z-80s 


ROM 
Firmware 


Phoneme synthesis using 


Volume, pitch, tone, 
rate 


Visually impaired 


$3,500 


$500 (extended warranty) 


General Instrument 
VSM2128 


Plug-in board 


GI PIC Series 

16K-128K ROM 
Firmware 

Phoneme synthesis, LPC, 
formant synthesis 
Unlimited 

None 

Volume, pitch, tone 


Uses GI SP-0256 chip 


Automobiles, education, 


security systems, games, 


visually impaired 


Mimic, Inc. 
Mimic Speech 
Processor 


TTL parallel port 


Apple tl; TRS-80 I, Il, 
iil; S-100 & std. bus 
Host RAM 


Mimtalk 


Time Domain 

U.S. Patent 4,271,332 
Unlimited 

All 

Sampling rate 


Can also be used for 
voice recognition 
Alarms, games, aid for 
handicapped; obtain data 


Immediate 
User manual — 
Factory 
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General Instrument 
SP-0250 


Not applicable 


Not applicable 


Unlimited 
Unlimited 
All 


Consumer, computer 


$292 at 10K quantity 


Not applicable 
Not applicable 
Yes 

6-8 weeks 
Not applicable 
Not applicable 


Mimic, Inc. 
V.O.LS. 


Switch Closures; TTL 
parallel port 


Apple Il to obtain 
speech data 
128K, 2764 EPROM 


VOIS support software 


Time Domain 

U.S. Patent 4,271,332 
128 words, typical 

All 

Sampling rate 


Selectable sirens and 
alerting tones 
Industrial alarms, 
vehicle and aircraft 
warning systems 


Contact vendor 


Yes 

Contact vendor 
User manual 
Factory 
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Service provided by 
Number installed 
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Main memory 


Software 
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Foreign languages supported 
Controls supported 
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TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 


Monthly rental 
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Product delivery ARO 
Training provided by 
Service provided by 
Number installed 


APRIL 1984 


All About Speech Technology 
SPEECH SYNTHESIS SYSTEMS 


National 
Semiconductor 
MM 54104 


Integrated circuit 


Firmware 


Mozer time domain ADPCM 


100-200 Unlimited 
Any Unlimited 
— Sampling cycle volume 


Natural male/female 
2000-4000 bits/sec 
Any voice output 


Speech recognition 


Contact vendor 


Yes 

Contact vendor 
National Semiconductor 
National Semiconductor 


Street Electronics 
ECHO I[ 


Speech Plus 
1000A/1020A 


RS-232-C;multibus;para- 
llel;peripheral (1020A) 


Intel 8085 Appie 6502 


108K EPROM 8K 


Firmware Proprietary 


2200 bps, 12-pole LPC {LPC 

Unlimited 

Not applicable 

Rate, pronunciation, 
punctuation, pitch 
Speak a string of cap- 
ital letters 

Aid to the sight- 
impaired; education; 
industrial, games 


Pitch, rate, volume 
Male or female voices 


Alarm, test equip., 
public address, 
automation 


$1,200-$2,500 $149.95 


Yes 

Next day to 2 weeks 
Street Electronics Corp. 
One 


Announcements, warnings 
audio response systems 


Plug-in card; Apple buss 


Panasonic 
MN6401 


IC chip 


NEC MN6401 
32K ROM 


Firmware 


Partial Auto Correlation 


Unilimited 
Speech rate 
IC chip — 


Consumer products, 
games, education 


Contact vendor 


Street Electronics 
ECHO GP/PC 


Rate, pronunciation, 
punctuation, pitch 


impaired; education; 
industrial, games 


Street Electronics Corp. 


One 
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Speech Plus 
PROSE 2000 


RS-232-C; multibus 


intel 8086 
96K EPROM 
Firmware 


Text-to-speech; formant 
synthesizer 

Unlimited 

None 

Pitch, rate, volume, 
synchronization 
Linguistic controls; 

male voice 

Training, test equip., 
robotics, telecom alarm 


Texas Instruments 
Speech Command 
System 


Phone, TIPC Bus 


TMS 320, TMS 7000 


32K Static RAM 


LPC-10, 2400bps 


16 min/320K floppy 
Any 

Rate, volume, software 
controlled 

Transparent keyboard 


Office/knowledge worker, 
factory automation 


Yes 

Contact TI Dealer 
TI Dealer 

TI or Dealer 
1000 
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Triformation 


Systems Free Scan 
Speech Terminal 


MANUFACTURER 
AND MODEL 


Verbex 
Model 3000/SPADS 


Triformation Systems 
AVC-1 


Triformation Systems 
VIP System 


COMMUNICATIONS INTERFACE |RS-232-C RS-232-C, 20ma current 
loop 
PROCESSING FACILITIES 

Type —_ 8086 control processor 
2-4 2901-based sp. proc. 
.5-1 megabyte 


Main memory ROM 2K RAM 


Software Firmware Firmware 
SPEECH CHARACTERISTICS 


Synthesis technique(s) 


Proprietary 


Text-to-speech, 
phoneme synthesis 
Vocabulary size, maximum words/Unlimited 60 

Foreign languages supported —_ Any 

Controls supported Volume, pitch, tone Volume, tone Volume, tone No restrictions 


8 KH, Recording 


Additional characteristics Speaks entire words Speaks entire words or {Speaks entire words or 
or spells out words spells out words spells out words 
Visually handicapped Visually handicapped Visually handicapped 


users users users 


High quality natural 
speech 

Unlimited duration 
utterances 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 

Purchase price $17,900-$32,000 
Monthly rental Base available 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 
Number installed 


MANUFACTURER 
AND MODEL 


Personal 
Speech System 


Votrax 


SPEECH PAC Type-'N-Talk 


COMMUNICATIONS INTERFACE Parallel RS-232-C; parallel RS-232-C; parallel port 


PROCESSING FACILITIES 
Type Votrax SC-01 Votrax SC-01 (refined) 


Main memory 32K EPROM 1K RAM, 8K ROM —_ 


Software 


SPEECH CHARACTERISTICS 


Synthesis technique(s) 


Firmware 


Phoneme synthesis 


Firmware 


Phoneme synthesis 


Firmware 
Phoneme synthesis 


Unlimited 


Firmware 
Phoneme synthesis 


Unlimited 


Vocabulary size, maximum words}Unlimited 


Foreign languages supported 
Controls supported 


Additional characteristics 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 


Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 
Number installed 


None 
Pitch, volume, tone 


Complete board-level 
system 

Alarm systems, eleva- 
tors, education, toys 


Contact vendor 


Pitch, volume, tone 


Complete microcomputer 
system 

Industrial control, 
games, sound effects 


Contact vendor 


None 
Pitch, volume, tone 


750-character buffer 
for typed characters 
Computer prompting, 
computer-aided 
instruction 


$345 


User manual 
Factory 
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None 
Pitch, volume, tone 


3500-character buffer; 
sim. speech/sound 
Computer prompting, 
computer-aided 
instruction 


$395 


User manual 
Factory 


APRIL 1984 


MANUFACTURER 
AND MODEL 
Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, calls per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 


Service provided by 


Number installed 


APRIL 1984 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


BBL Industries 
Voice Retrieval System 


Voice store-and- 
forward 

Freestanding or integ- 
rated with radio paging 
& mobile telephone 


2-64 


8 local or remote 
RS-232 interface ports 


Message center; radio 
paging; mobile telephone 


$70,000 up 


Time & material 


Centigram 
VoiceMemo 
Voice store-and-forward 


Freestanding systems; 
connect to PBX or Key 


Proprietary 


Voice prompts 


Words, phrases 
Unlimited 


Digitized speech 


Office communications; 
telephone answering; 
lodging 


Contact distributor 


Contact distributor 
30 days 

Contact distributor 
Contact distributor 


5 


Cognitronics 
Model 681 
Speechmaker 


Voice response 


Front-end 


Up to 3 


Output only 


Words, phrases 
64 words/message 


Data modulation 


Alarm announcements, 
security systems, 


Contact vendor 
Yes 

30 days 
Cognitronics 


Cognitronics 
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70G-700-01q 
Communications 


Cognitronics 
Model 682 
Speechmaker 


Voice response 


Subsystem under host 
control 


Up to 48; increments 
of 4; FCC approved 


Touch-tone input 
devices, transaction 
phone, ATMs, POS, PC 
Motorola 6800 
Proprietary 


Voice, tone, ASCII 


Words, phrases 
Unlimited 


Deita modulation 


1,440 (24 lines) 
480 (24 lines) 


Banking, order entry, 
credit authorization 


$16,500 
Contact vendor 
Contact vendor 
Yes 

60 days 
Cognitronics 


Cognitronics 


70G-700-01r 
Communications 


MANUFACTURER 
AND MODEL 


Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 


PROCESSING FACILITIES 
Type 


Software 
Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, calls per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 


Service provided by 


Number installed 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Cognitronics 
Model 683 
Speechmaker 


Voice response 


Subsystem under host 
control 


Up to 12; single 
increments; FCC approved 


Touch-tone input 
devices, transaction 
phone, ATMs, POS, PC 
Motorola 6800 
Proprietary 


Voice, tone, ASCII 


Words, phrases 
Unlimited 


Delta modulation 


360 (6 lines) 
120 (6 lines) 


Banking, order entry, 
credit authorization 


$6,800 
Contact vendor 
Contact vendor 
Yes 

30 days 
Cognitronics 


Cognitronics 


Cognitronics 
VRCS 


Voice response 


Freestanding system; 


on-line interaction; 
RJE interactive 


Up to 12 (Option) 
Up to 48 (Option) 


Touch-tone input 
devices, transaction 


phone, ATMs, POS, PC 


Proprietary 


Voice, tone, ASCH 


Words, phrases 
Unlimited 


Delta modulation 


360-1,440 
120-480 


Banking, order entry, 
credit authorization 


Cognitronics 


Cognitronics 


Commterm 
EVX 1000/EVXTRA 


Voice store-and- 
forward 
Freestanding 


PBX programming (in- 
tegrated system) 
PBX programming (in- 
tegrated system) 
PBX programming (in- 
tegrated system) 


Phrases, words 


Digital 


Up to 7 minutes 
Individually programmed 
via user needs 

RCC, TAS, cellular, PBX 


Commterm 
Commterm 


Over 70 
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Computer Curriculum 
Corp. Digital Speech 
System (DSS) 


Voice response 


Freestanding; 16-term- 
inal cluster subsystems 


1-5 


Alphanumeric terminals 


Proprietary 


Prompts via host 


Words, phrases 
Unlimited 


LPC 


360-720 
120-240 


Computer-aided 
instruction 


$9,000 to $10,000 


APRIL 1984 


MANUFACTURER 
AND MODEL 
Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, calls per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 


Service provided by 


Number installed 


APRIL 1984 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Computer Curriculum 
Corp. Telephone Access 
Unit (TAU) 

Voice response 
Freestanding; 6- to 


12-terminal cluster 
subsystem 


1-5 


Alphanumeric terminals, 
touch-tone input 


Proprietary 


Prompts via host 


Words, phrases 
Unlimited 


LPC 


360-720 
120-240 


Computer-aided 
instruction 


$5,000 to $7,500 


Digital Pathways 
SLC-lI 


Voice response 


Freestanding 


4 RS-232-C, 1 telephone 


Touch-tone input, 
ASCII modems 


Proprietary 6502-based 


Unlimited 


Linear predictive 


REPRODUCTION PROHIBITED 


Digital Pathways 
DEFENDER If 
Voice response 


Freestanding; front-end 
processor 


Incoming 4 to 12 
Outgoing 4 to 384 


Touch-tone 


Proprietary 6502-based 
Firmware 


Prompting 


Phrases 


12-15 


4-12 concurrent 
4-12 concurrent 


Computer access; 
management/security 


60 days 

Digital Pathways 
Digital Pathways 
30 
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Communications 


Engineered 
Systems, Inc. 
TCS/500 Series 


Voice response 


Front end or free- 
standing 


1-72; increments of 1 


Touch-tone POS, personal 
computers 

Ti 990 Series 

Proprietary 


Voice prompt; ASCil 


Words, phrases, text 
Unlimited 


LPC 


60-4,320 
20-1,440 


Banking, data entry, 
credit authorization, 
order entry, customer 
service 

$20,000 up 

Contact vendor 


Contact vendor 


Yes 


70G-700-O1t 
Communications 


MANUFACTURER 
AND MODEL 


Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, calls per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 


Service provided by 


Number installed 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Interface 
Technology, Inc. 
TOES 


Voice response 


Freestanding; front-end 


1-32 


Touch-tone pads, 

CRTs, PC, POS terminals 
IBM Series/ 1 

Total Entry System 


Prompts, verifies, 
validates 


Continuously variable- 
slope delta modulation 


4,320 (max.) 
1,440 (max.) 
Order entry, account 


inquiry, meter report- 
ing, payroll reporting, 


Interface Technology 


Interface Technology and 


International Business 
Machines Series/1 Audio 


Distribution System 


Voice store-and-forward 


Freestanding system; 
connect to PABX 


2-10 


Touch-tone input 


IBM Series/1 
IBM Licensed Program 


Voice HELP facility 


Sentences 


Digitized voice 


Up to 1,630/day 


Sales/mktg., banking, 
remote entry 


$12,700 basic license 


$350 basic license 


Contact vendor 
IBM 


IBM 


interstate 
Electronics 
VTM 150 


Voice response 


Component within 
computer system 


Up to 28 serial, 
parallel, or Multibus 


Alphanumeric terminals 


Proprietary 
Proprietary in EPROM 


12 commands 


Words, phrases 
Up to 1,500-word 


vocabulary 
Phoneme synthesis 


Data entry 


Contact vendor 
Contact vendor 
Contact vendor 
Contact vendor 
Contact vendor 
Interstate 


Interstate 
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Master Specialties Co. 
Model 1651 
Voice response 


Subsystem under host 
control 


Z-80, others 
Proprietary in ROM, 
PROM 


Words, phrases 
Up to 256 sec./msg. 
Digitized voice, 


delta modulation 


Aircraft warning 
systems, telecommuni- 
cations, paging, credit 
verification 


$2,000 up 


APRIL 1984 


MANUFACTURER 
AND MODEL 
Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, calls per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 


Service provided by 


Number installed 


APRIL 1984 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Perception 
Technology Corp. 
BT-il 


Voice response 


Freestanding front-end 
processor-emulates 
terminal interface 


4-32; 4-line increments 
asynchronous 


Data entry via touch 
tone telephone; POS 
terminals 


Digital Equipment 
Corp. LSI-11 
Proprietary 


Prompt., repeat input, 
invalid entry 


Words, phrases-48 sec. 


to 16% hours 
Unlimited 


Adaptive delta modula- 
tion 

240-1,920 

80-640 

Cash balance reporting, 


payroll & order entry, 
credit authorization 


$16,610 up 
Contact vendor 
Contact vendor 
Yes 

30 days 


PTC 


PTC/Digital Equip. Corp. 


Over 300 


Perception 
Technology Corp. 
VOCOM | 


Voice response 


Freestanding front-end 
processor-emulates 
terminal or terminal 
controller 


4-32; 4-line increments 
asynch. /bisync. 


Data entry via touch 
tone telephone 


Digital Equipment Corp. 
LSI-11 
Proprietary 


Prompt., repeat input, 
invalid entry 


Words, phrases-48 sec. 


to 16% hours 
Unlimited 


Adaptive delta modula- 
tion 

240-1,920 

80-640 

Cash balance reporting, 


payroll & order entry, 
credit authorization 


$49,500 up 
Contact vendor 
Contact vendor 
Yes 

60 days 


PTC 


PTC/Digital Equip. Corp. 


Over 130 


Perception 
Technology Corp. 
VOCOM II 


Voice response 


Standalone data base 


4-256; 4-line increments 


Data entry via touch 
tone telephone 


Digital Equipment Corp. 
PDP-11, LSI-11 


Prompt., repeat input, 
invalid entry 


Words, phrases-48 sec. 
to 16% hours 
Unlimited 


Adaptive delta modula- 
tion 


192-1,536 
64-512 


Cash balance reporting, 
payroll & order entry, 
credit authorization 


$60,000 up 


Contact vendor 

Contact vendor 

Yes 

90 days 

PTC/Digital Equip. Corp. 
PTC/Digital Equip. Corp. 


Over 30 
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70G-700-01u 
Communications 


Periphonics Corporation 
VoicePac/T-Comm 


Voice response 
Freestanding subsystem 


and component within 
larger system 


78 (VoicePac) 
400 (T-Comm) 


Touch-tone and various 
data terminals 


DEC (T-Comm), PDP-11 
Series 
Proprietary 


Yes 


Words, phrases 
Variable 


Digitized human voice 


500/16, 1500/38 lines 
500/38 lines, 1500/100 
lines 

Order entry, credit 
authorization, bank from 
home 


$25,000 (VoicePac) 
$50,000 (T-Comm) 
Variable 

Approx. $400 

Yes 

90 days 
Periphonics 


Periphonics 


200 (VoicePac) 
500 (T-Comm) 


70G-700-O1v 
Communications 


MANUFACTURER 
AND MODEL 
Type 


CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 


PROCESSING FACILITIES 
Type 


Software 
Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, calls per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 


Service provided by 


Number installed 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Periphonics Corporation | Periphonics Corporation | Periphonics Corporation 


Telemarketer 


Voice response 
Freestanding subsystem 


and component within 
larger system 


20 


Touch-tone and various 
data terminals 


Multiple 32 bit and 16 
bit 

Proprietary and UNIX 
based 

Yes 

Words, phrases 


Variable 


Digitized human voice 


500/16, 1500/38 lines 
500/38 lines, 1500/100 
lines 

Order entry 


Approx. $600 
Yes 

90 days 
Periphonics 


Periphonics 


VoiceStar/CommStar 


Voice response 


Freestanding subsystem 
and component within 
larger system 


20 or more 


TT and dial up; data 
terminals ‘ 


Dual 32 bit and 16 bit, 
PDP-11 and M 68000 
Proprietary and UNIX 
based 

Yes 


Words, phrases 
Variable 


Digitized human voice 


500/16, 1500/38 lines 
500/38 lines, 1500/100 
lines 

Order entry, credit 
authorization, bank from 
home 


$50,000 (VoiceStar) 
$75,000 (CommStar) 
Variable 


$300-$400 


Yes 


120 days 
Periphonics 
Periphonics 


VoiceBox 


Voice response 


Component in larger 
system 


32 


TT and dial up; data 
terminals 

PDP-11 
Proprietary/firmware 


Yes 


Words, phrases 
Up to 256 seconds 


Digitized human voice 


500/16, 1500/38 lines 
500/38 lines, 1500/100 
lines 

Order entry, credit 
authorization, bank from 
home 


Under $20,000 


Variable 
Approx. $200 
Yes 

60 days 
Periphonics 
Periphonics 
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Speech Plus 
CallText 5000 


Voice response 


IBM PC compatible card 


1 (can dial out) 


Touch-tone pad 


Firmware 


Voice Prompts 


Text-to-speech 
Unlimited 


Proprietary 


1,440 per line 
480 per line 


All remote inquiry 


30 days 


Speech Plus 


APRIL 1984 


MANUFACTURER 
AND MODEL 


Type 


CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 


PROCESSING FACILITIES 
Type 


Software 


Interaction 


SPEECH CHARACTERISTICS 


Type 
Size 


Generation technique 


Message capacity, calls per hr. 


1-minute calls 
3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 


Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 


Number installed 


SEPTEMBER 1984 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Speech Plus 
CaliText 5050 


Voice response 


Computer peripheral on 


RS-232-C port 


1-6 (can dial out) 


Touch-tone pad 


Firmware 


Voice Prompts 


Text-to-speech 
Unlimited 


Proprietary 


1,440 per line 
480 per line 


All remote inquiry 


$3,000-$ 17,000 


30 days 


Speech Plus 


Speech Plus 
CallText 5100 


Voice response 


Freestanding subsystem; 
connect to computer 


1-6 (can dial out) 


Touch-tone pad 


Intel 8088 
Basic, Assembler 


Voice Prompts 


Text-to-speech 
Unlimited 


Proprietary 


1,440 per line 
480 per line 


All remote inquiry, 


sales, service, 
transactions, mail 


$10,000-$25,000 


60 days 
Speech Plus 


Speech Pius 


Texas Instruments 
Speech Command 
System 


Voice response 


Integrated option for 
Tl Professional Computer 


1 phone line & desk 
phone 


Phone (touch tone) & 
phone line 


TMS 320, TMS 7000 


Downloadable speech 
algorithms 

Noise prompting for 
remote message review 


Analysis/synthesis 
16 min/floppy or 8 hour 


Winchester disk 
LPC 


99 stored phone messages 
99 stored phone messages 


Office /knowledge worker, 
factory automation 


Contact Tl! Dealer 
Dealer 
Ti or Dealer 


1000 


© 1984 DATAPRO RESEARCH CORPORATION, DELRAN, NJ 08075 USA 


REPRODUCTION PROHIBITED 


70G-700-01w 
Communications 


Time and Space 
Processing 
Series 1000 


Voice digitizer; voice 
response; store & forw. 
Freestanding or 
component within larger 
system 


1 (connect to PBX) 


Touch-tone input 


Proprietary 


Words, phrases, 
continuous speech 
Unlimited 


Digitized voice 


Real time voice trans- 
mission at 2400 bps; 
store & forward; voice 
response & recognition 


$12,500 


60 days 


Time and Space 
Processing 
Time and Space 
Processing 


70G-700-01x 
Communications 


MANUFACTURER 
AND MODEL 
Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 

Generation technique 

Message capacity, cails per hr. 
1-minute calls 


3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 


Number installed 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Time and Space 
Processing 
Series 2400 


Voice privacy terminal 


Freestanding 


Touch-tone input 


Proprietary 


Words, phrases, 
continuous speech 
Unlimited 


Digitized voice 


Voice privacy 


90 days 


Time and Space 
Processing 
Time and Space 
Processing 


VMX Incorporated 


VMX/16 


Voice store-and- 
forward 
Freestanding system; 
connect to PABX 


16 


Touch-tone input 


Intel 
Proprietary 


Voice prompts 


Sentences 
Max. 10-min. msg. 


Digitized voice 


3,140/day 


Sales/mktg., banking, 


VMxX Incorporated 
VMX/64 


Voice store-and- 
forward 

Freestanding system; 
connect to PABX 


64 


Touch-tone input 


Intel 
Proprietary 


Voice prompts 


Sentences 
Max. 10-min. msg. 


Digitized voice 


17,190/day 


Sales/mktg., banking, 
field service 
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Voice Computer 
Technologies 
Series 2000 


Voice response; voice 
store-and-forward 
Freestanding or used as 
a VR front end 


Max 8 phone lines 
Min 2 phone lines 


Touch-tone pads for 
both alpha and digits 


Voice prompts 


Words, phrases, 
continuous speech 
Unlimited 


Digitized voice 


Order entry; data entry; 
information dissemina- 
tion; voice mail; 
college registration 


$27,500-$49,000 


SEPTEMBER 1984 


MANUFACTURER 
AND MODEL 
Type 


CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


Interaction 


SPEECH CHARACTERISTICS 
Type 


Size 


Generation technique 


Message capacity, calls per hr. 


1-minute calls 
3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 
Monthly rental 
Monthly maintenance 
Quantity discounts 


Product delivery ARO 


Training provided by 


Service provided by 


Number installed 


APRIL 1984 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Speech Plus 
CallText 5050 


Voice response 


Computer peripheral on 
RS-232-C port 


1-6 (can dial out) 


Touch-tone pad 


Firmware 


Voice Prompts 


Text-to-speech 
Unlimited 


Proprietary 


1,440 per line 
480 per line 


All remote inquiry 


$3,000-$ 17,000 


30 days 


Speech Plus 


Speech Plus 


CaliText 5100 


Voice response 


Freestanding subsystem; 
connect to computer 


1-6 (can dial out) 


Touch-tone pad 


Intel 8088 
Basic, Assembler 


Voice Prompts 


Text-to-speech 
Unlimited 


Proprietary 


1,440 per line 
480 per line 


All remote inquiry, 


sales, service, 
transactions, mail 


$10,000-$25,000 


Texas Instruments 
Speech Command 
System 


Voice response 


Integrated option for 


TI Professional Computer 


1 phone line & desk 
phone 


Phone (touch tone) & 
phone line 

TMS 320, TMS 7000 
Downloadable speech 
algorithms 

Noise prompting for 


remote message review 


Analysis/synthesis 


16 min/floppy or 8 hour 


Winchester disk 
LPC 


99 stored phone messages 
99 stored phone messages 


Office /knowledge worker, 


factory automation 


TI or Dealer 


1000 
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70G-700-01w 
Communications 


Time and Space 
Processing 
Series 1000 


Voice digitizer; voice 
response; store & forw. 
Freestanding or 
component within larger 
system 


1 (connect to PBX) 


Touch-tone input 


Proprietary 


Words, phrases, 
continuous speech 
Unlimited 


Digitized voice 


Real time voice trans- 
mission at 2400 bps; 
store & forward; voice 
response & recognition 


$12,500 


60 days 


Time and Space 
Processing 
Time and Space 
Processing 


70G-700-01x 
Communications 


Ail About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 
Voice Computer 


Technologies 
Series 2000 


Time and Space 
Processing 
Series 2400 


VMX Incorporated 
VMX/64 


MANUFACTURER 
AND MODEL 


VMxX Incorporated 
VMX/16 


Type 
CONFIGURATION 


COMMUNICATIONS INPUT 


Number of lines 


Remote terminals supported 


|PROCESSING FACILITIES 


Type 
Software 


Interaction 


SPEECH CHARACTERISTICS 


| Type 
Size 


Generation technique 


Message capacity, calls per hr. 


1-minute calls 
3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 


Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 


Number installed 


Voice privacy terminal 


Freestanding 


Touch-tone input 


Proprietary 


Words, phrases, 
continuous speech 
Unlimited 


Digitized voice 


Voice privacy 


90 days 


Time and Space 
Processing 
Time and Space 
Processing 


Voice store-and- 
forward 

Freestanding system; 
connect to PABX 


16 


Touch-tone input 


Intel 
Proprietary 


Voice prompts 


Sentences 
Max. 10-min. msg. 


Digitized voice 


3,140/day 


Sales/mktg., banking, 
field service 


Voice store-and- 
forward 

Freestanding system; 
connect to PABX 


64 


Touch-tone input 


Intel 
Proprietary 


Voice prompts 


Sentences 
Max. 10-min. msg. 


Digitized voice 


17,190/day 


Sales/mktg., banking, 
field service 
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Voice response; voice 
store-and-forward 
Freestanding or used as 
a VR front end 


Max 8 phone lines 
Min 2 phone lines 
4-10 RS-232-C ports 
Touch-tone pads for 
both alpha and digits 


8086 based boards 
Proprietary 


Proprietary-multitasking 


Phrases and/or complete 
sentences 
Limited to disk 


Order entry; data entry; 
information dissemina- 
tion; voice mail 


$27,500-$49,000 


Digital Pathways 


APRIL 1984 


MANUFACTURER 
AND MODEL 
Type 
CONFIGURATION 


COMMUNICATIONS INPUT 
Number of lines 


Remote terminals supported 
PROCESSING FACILITIES 

Type 

Software 


interaction 


SPEECH CHARACTERISTICS 
Type 


Size 


Generation technique 


Message capacity, calls per hr. 


1-minute calls 
3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 
SERVICE 
Purchase price 


Monthly rental 


Monthly maintenance 


Quantity discounts 
Product delivery ARO 
Training provided by 
Service provided by 


Number installed 


APRIL 1984 


All About Speech Technology 
VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Votan 
VX Series Boards 


Voice response; store- 
and-forward 
Boards 


1 voice channel 


Voice transmitted over 
phone; touch tone 


TMS 320, custom VLSI, 
Intel 8086 
Proprietary 


Voice, prompting 


Natural sentences 
Limited by storage only 


Digitized voice 


Limited by storage only 
Limited by storage only 


Phone-based trans- 


actions, voice mail, 
data retrieval 


Approximately $6,000 


Contact vendor 


Votan Voice 
Management System 


Voice response; store- 
and-forward 
Multi-channel voice 
system 


1 line per voice channel 
(4-8 channels) 


Voice transmitted over 


phone; touch tone 


TMS 320, custom VLSI, 


Voice, prompting 


Natural sentences 
Limited by storage only 


Digitized voice 


Limited by storage only 
Limited by storage only 


Phone-based trans- 
actions, voice mail, 
data retrieval 


$28,000 (4 channels) 
$36,000 (8 channels) 
Contact vendor 


Votan 
Terminal 


Voice response; store- 
and-forward 
Freestanding voice 
terminal 


1 voice channel 


Voice transmitted over 
phone; touch tone 


TMS 320, custom VLSI, 


Voice, prompting 


Natural sentences 
Limited by storage only 


Digitized voice 


Limited by storage only 
Limited by storage only 


Phone-based trans- 


actions, voice mail, 
data retrieval 


$3,000-$4,000 


Contact vendor 
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70G:700-O01y 
Communications 


Votrax LVM 
Business Communicator 
Voice response 
Freestanding system, 


subsystem, remote 
system 


4-64; increments of 4 
lines 
Touch-tone input; 


P.O.S. terminals, low 
speed terminals 


Proprietary 


Voice/Tone/FSK 


Words, phrases 
4,000+ words/messages 


Digitized human voice 


240-3,840 
80-1,280 


Banking and order entry, 


credit verification, 
data entry, security 


7,340 and up 
Contact vendor 
Contact vendor 
Yes 

30 days 
Votrax 


Votrax/3rd party 


70G-700-01z 
Communications 


MANUFACTURER 


AND MODEL 


Type 
CONFIGURATION 


COMMUNICATIONS INPUT 


Number of lines 


Remote terminals supported 


- PROCESSING FACILITIES 
Type 


Software 


Interaction 


SPEECH CHARACTERISTICS 


Type 
Size 


Generation technique 


Message capacity, calls per hr. 


1-minute calls 
3-minute calls 


TYPICAL APPLICATIONS 


PRICING, AVAILABILITY, AND 


SERVICE 
Purchase price 


Monthly rental 
Monthly maintenance 
Quantity discounts 
Product delivery ARO 


Training provided by 


Service provided by 


Number installed 
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VOICE RESPONSE/STORE-AND-FORWARD SYSTEMS 


Vynet Corporation 
V 100 Series 


Voice response; voice 
store-and-forward 
Board level 


1-2 


Touch-tone input 


IBM PC, XT 
Proprietary 


Prompts 


Words, phrases 
Unlimited 


LPC or ADPCM 


Auto voice; response 
to telephone caller 


$444-$ 1988 
Contact vendor 


Contact vendor 


Vynet Corporation 
V_ 1000/2000 Series 


Voice response; voice 
store-and-forward 
Subsystem or remote 
system connected to 
host 


2-8 


Touch-tone; touch tone 
and telecommunications 
network 

IBM PC, XT; proprietary 
Proprietary 


Prompts 


Words, phrases 


Auto voice; response 
to telephone caller 


$7,500-$20,000 
Contact vendor 


Contact vendor 


Wang Labs 
DVX 
Voice store-and forward 


Freestanding system; 
connect to PABX 


4-16 


Touch-tone input 


Max. 6-min. msg. 


Digitized voice 


360 to 3,140/day 


General communications, 
sales/mktg., banking, 
remote input 


$90,250 to $186,250 
$140 license fee 
No 


60-90 days 
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